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1 CCTGCCTGGT CCTCTGTGCC 
51 AGGAGCCCAT TTGGTAGTGA 
101 CCTGGCCGTG ATAGTGGCCA 
.151 GGCGCCAACG CTGGGCTGCA 
201 GGGCTGGGCA ACCTGCTGCA 
2 51 CGACCAGTTG CGGCGCCGCT 
301 GGACGCCGGT GGTCGTGCTC 
351 GTGACCCACG GCGAGGACAC 
4 01 GATCCTGGGT TTTGGGCCGC 
4 51 GTCTCTTGGA CAAAGCCGTG 
501 CGCCGCTTCG AGTACGACGA 
551 TCAGGAGGGA CTGAAGGAGG 
601 CTGTCCCCGT CCTCCTGCAT 
651 TTCCAAAAGG CTTTCCTGAC 
701 GATGACCTGG GACCCAGCCC 
751 TGGCAGAGAT GGAGAAGGCC 
801 GAGAACCTGC GCATAGTGGT 
851 CACCTCGACC ACGCTGGCCT 
901 ATGTGCAGCG CCGTGTCCAA 
951 CGGCGACCAG AGATGGGTGA 
1001 GATTCATGAG GTGCAGCGCT 
1051 ATATGACATC CCGTGACATC 
1101 ACGACACTCA TCACCAACCT 
1151 GGAGAAGCCC TTCCGCTTCC 
1201 ACTTTGTGAA GCCGGAGGCC 
1251 TGCCTCGGGG AGCCCCTGGC 
1301 CCTGCTGCAG CACTTCAGCT 
1351 GCCACCATGG TGTCTTTGCT 
14 01 TGTGCTGTGC CCCGCTAGAA 
14 51 GCCAGAGGCT CTAATGTACA 
1501 AAAAAAAAAA AAAAAAAAAA 
(SEQ ID NO: 1) 



TGGTGGGGTG GGGGTGCCAG GTGTGTCCAG 
GGCAGGTATG GGGCTAGAAG CACTGGTGCC 
TCTTCCTGCT CCTGGTGGAC CTGATGCACC 
CGCTACTCAC CAGGCCCCCT GCCACTGCCC 
TGTGGACTTC CAGAACACAC CATACTGCTT 
TCGGGGACGT GTTCAGCCTG CAGCTGGCCT 
AATGGGCTGG CGGCCGTGCG CGAGGCGCTG 
CGCCGACCGC CCGCCTGTGC CCATCACCCA 
GTTCCCAAGG ACGCCCCTTT CGCCCCAACG 
AGCAACGTGA TCGCCTCCCT CACCTGCGGG 
CCCTCGCTTC CTCAGGCTGC TGGACCTAGC 
AGTCGGGCTT TCTGCGCGAG GTGCTGAATG 
ATCCCAGCGC TGGCTGGCAA GGTCCTACGC 
CCAGCTGGAT GAGCTGCTAA CTGAGCACAG 
AGCCCCCCCG AGACCTGACT GAGGCCTTCC 
AAGGGGAACC CTGAGAGCAG CTTCAATGAT 
GGCTGACCTG TTCTCTGCCG GGATGGTGAC 
GGGGCCTCCT GCTCATGATC CTACATCCGG 
CAGGAGATCG ACGACGTGAT AGGGCAGGTG 
CCAGGCTCAC ATGCCCTACA CCACTGCCGT 
TTGGGGACAT CGTCCCCCTG GGTGTGACCC 
GAAGTACAGG GCTTCCGCAT CCCTAAGGGA 
GTCATCGGTG CTGAAGGATG AGGCCGTCTG 
ACCCCGAACA CTTCCTGGAT GCCCAGGGCC 
TTCCTGCCTT TCTCAGCAGG CCGCCGTGCA 
CCGCATGGAG CTCTTCCTCT TCTTCACCTC 
TCTCGGTGCC CACTGGACAG CCCCGGCCCA 
TTCCTGGTGA CCCCATCCCC CTATGAGCTT 
TGGGGTACCT AGTCCCCAGC CTGCTCCCTA 
ATAAAGCAAT GTGGTAGTTC CAAAAAAAAA 
AAAAAAAAAA AAAAAAA 



FEATURES : 

5'UTR: 1-77 

Start Codon: 78 

Stop Codon: 1416 

3'UTR: 1419 



Homologous proteins : 

Top 10 BLAST Hits 



CRA| 
CRAI 
CRA| 
CRA| 
CRA| 
CRAI 
CRAI 
CRAI 
CRAI 
CRAI 



18000004889269 
18000004927597 
18000004923926 
18000005007118 
18000005100319 
18000004884804 
18000004889271 
18000004884803 
18000004939934 
18000005107537 



/altid=gi 
/altid=gi 
/altid=gi 
/altid=gi 
/altid=gi 
/altid=gi 
/altid=gi 
/altid-gi 
/altid=gi 
/altid=gi 



181304 /def=gb [AAA53500.il (M33388. 
4503223 /def =ref I NP_000097 . 1 | cyto. 
181306 /def=gb|AAA35737.1| (M33189. 
2493367 /def =sp | Q294 88 | CPDHJ4ACFA . 
3913340 /def=sp|018992|CPDJ_CALJA . 
486997 /def=pir| IS37284 cytochrome. 
522195 /def=gb|AAA36403.1| (M24499. 
461826 /def=sp|Q01361|CPDE_BOVIN C. 
117244 /def=spi P13108 |CPD4_RAT CYT . 
2575863 /def=dbj I BAA23125 . 1 1 (AB00. 
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0 


0 
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0 


0 
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0 


0 



EST: 

Score E 

Sequences producing significant alignments: (bits) Value 

gil 9872134 /dataset=dbest /taxon=960 . . . 775 0.0 

gi I 6144331 /da taset=dbest /taxon=9606 ... 648 0.0 

gil 6703894 /da taset=dbest /taxon=9606 ... 648 0.0 



EXPRESSION INFORMATION FOR MODULATORY USE: 

gi|9872134 /liver 
gi I 6144331 /kidney 
gil 6703894 /lung 



Tissue Expression: 
Whole Liver 
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1 MGLEALVPLA VIVAIFLLLV DLMHRRQRWA 

51 FQNTPYCFDQ LRRRFGDVFS LQLAWTPVVV 

101 RPPVPITQIL GFGPRSQGRP FRPNGLLDKA 

151 FLRLLDLAQE GLKEESGFLR EVLNAVPVLL 

201 DELLTEHRMT WDPAQPPRDL TEAFLAEMEK 

251 LFSAGMVTTS TTLAWGLLLM ILHPDVQRRV 

301 HMPYTTAVIH EVQRFGDIVP LGVTHMTSRD 

351 VLKDEAVWEK PFRFHPEHFL DAQGHFVKPE 

401 ELFLFFTSLL QHFSFSVPTG QPRPSHHGVF 
(SEQ ID NO: 2) 

FEATURES : 

Functional domains and key regions: 

[1] PDOC00001 PS00001 ASN_GLYCOSYLATION 
N-glycosylation site 

347-350 NLSS 



[2] PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 

327-329 TSR 



[3] PDOC00006 PS00006 CK2_PHOSPHO_SITE 
Casein kinase II phosphorylation site 

Number of matches: 5 

1 93-96 THGE 

2 198-201 TQLD 

3 238-241 SFND 

4 327-330 TSRD 

5 437-440 SPYE 



[4] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 

Number of matches: 2 

1 233-238 GNPESS 

2 255-260 GMVTTS 



[5] PDOC00009 PS00009 AM I DAT I ON 
Amidation site 

Number of matches: 2 

1 140-143 CGRR 

2 387-390 AGRR 



[6] PDOC00081 PS00086 CYTOCHROME_P4 50 
Cytochrome P4 50 cysteine heme-iron ligand signature 

385-394 FSAGRRACLG 

Membrane spanning structure and domains: 



Helix 


Begin 


End 


Score 


Certainity 


1 


3 


23 


1.877 


Certain 


2 


68 


88 


1.096 


Certain 


3 


171 


191 


0.668 


Putative 


4 


252 


272 


1.914 


Certain 


5 


400 


420 


1.402 


Certain 


6 


425 


445 


0.833 


Putative 



ARYSPGPLPL 
LNGLAAVREA 
VSNVIASLTC 
HI PALAGKVL 
AKGNPESSFN 
QQEIDDVIGQ 
IEVQGFRIPK 
AFLPFSAGRR 
AFLVTPSPYE 



PGLGNLLHVD 
LVTHGEDTAD 
GRRFEYDDPR 
RFQKAFLTQL 
DENLRI WAD 
VRRPEMGDQA 
GTTLITNLSS 
ACLGEPLARM 
LCAVPR 
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BLAST Alignment to Top Hit: 

>CRA I 18000004889269 /alt id=gi I 181304 /def =gb I AAA53500 . 1 I (M33388) 
cytochrome P450 IID6 [Homo sapiens] /org^Homo sapiens 
/taxon=9606 /dataset=nraa /length=497 
Length =4 97 

Score = 884 bits (2259), Expect = 0.0 

Identities - 444/497 (89%), Positives = 445/497 (89%), Gaps = 51/497 (10%) 



Que r y : 
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Sb j ct : 
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Que r y : 


61 


Sb j c t : 
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119 


Sb j ct * 


121 
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i in 

1 Jv 




10 1 


Que r y : 


i on 


Sbjct : 


241 
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250 


Sbjct : 


301 


Query: 


310 


Sbjct: 


361 
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370 


Sbjct: 


421 
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430 


Sbjct: 


481 



MGLEALVPLAVIVAIFLLLVDLMHRRQRWAARYSPGPLPLPGLGNLLHVDFQNTPYCFDQ 60 
MGLEALVPLAVIVAIFLLLVDLMHRRQRWAARY PGPLPLPGLGNLLHVDFQNTPYCFDQ 
MGLEALVPLAVIVAIFLLLVDLMHRRQRWAARYPPGPLPLPGLGNLLHVDFQNTPYCFDQ 60 



LRRRFGDVFSLQLAWTPVVVLNGLAAVREALVTHGEDTADRPPVPITQILGFGPRSQG 



-RPFRPNGLLDK 129 
RPFRPNGLLDK 



AVSNVIASLTCGRRFEYDDPRFLRLLDLAQEGLKEESGFLREVLNAVPVLLHIPALAGKV 



LRFQKAFLTQLDELLTEHRMTWDPAQPPRDLTEAFLAEMEKAKGNPESSFNDENLRI VVA 



DLFSAGMVTTSTTLAWGLLLMILHPDVQRRVQQEIDDVIGQVRRPEMGDQAHMPYTTAVI 



HEVQRFGDIVPLGVTHMTSRDIEVQGFRIPKGTTLITNLSSVLKDEAVWEKPFRFHPEHF 



LDAQGHFVKPEAFLPFSAGRRACLGEPLARMELFLFFTSLLQHFSFSVPTGQPRPSHHGV 



FAFLV+PSPYELCAVPR 



Hmmer search results (Pfam) : 

Scores for sequence family classification (score includes all domains) : 
Model Description Score E-value N 



PF00067 Cytochrome P450 516.7 1.7e-151 2 



Parsed for domains: 

Model Domain seq-f seq-t hmm-f hmm-t score E-value 

PF00067 1/2 35 113 1 92 [. 78.1 2.7e-21 

PF00067 2/2 117 443 .. 150 497 .] 442.7 3.3e-129 
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1 AGCCTTACAA AGTGCTGGGA TTACCTGCGT GAGCCACCGG GTCCGGCCTC 
51 TTTATGTCTT ACTGTACTGT CTGTCTTGAA AAGTACTTAT TATTTTTGAT 
101 TGGTTCATCA TTTAGTCTAA TTAAAATAAG AGTAGTTTAC ACACCACAAT 
151 TACAGTATTA TAATACTCTG TTTTTCTGTG TGCTTACTAT TACCAGTGAG 
201 TTTTGTACCT TTAGATGATT TCTTCTTGCT CATTAATATC CTTTTTTTTT 
251 TCAGATTGAA AAACTCCCTT TAGCATTTCT TGTGGGATAT AGGTCTGGTG 
301 TTGATGAAAT CTCGCAGCTT TTGTTTGTCT GGGAAGGTCT TTATTTCTCC 
351 TTCCTGTTGG AAGGATATTT TTGCCAGATA CGTTATTCTA GGCTAAAAGT 
4 01 TTTTTTTCCT TCAGCACTTT AAATATGTCA TGCCACTCCC CCCTGGCCTG 
4 51 TAAGGTTTCC ACTGGAAAGG TGGCTGCCCC ATGTCATGTA TTGGAGCTCT 
501 ACTGCATGTT ATTTGTTTCT TTTCTCTTGC TGCTTTTAGG ATCCTTTCTT 
551 TATCCTTGAC CTTTCGGAGT TTAATTATCA GATGCCTTGA GGTCGTCTTC 
601 TTTGGGTTAA ATCTGCTTGG TGTTCTATAA ACTTCTTGTA CAAAAAATCA 
651 GCCAGGCATG GTGGTGGGCA CCTGTAATCC CAGCTACTTG GGAGGCTGAG 
701 GCAGGAGAAT CGCTTGAACC CTGGAGGTGG AGGTTGCAGT GAGCCGAGAT 
751 CGCATCATTG CACTCCCACC TGGGCGACAG AGCAAAACTC CGTCTCAAAA 
801 AAAAAATTAT TTGGGCTCGG TGGTGCCTGT AGTCCCAGCT ACTTGGGAGG 
851 CAGGAGGTCC ACTTGATGTT GAGATTGCAG TGAGCCATGA TCCTGCCACT 
901 GCACTCCGGC CCGGGCAACA GAGTGAGACC CTGTCTAAAG AAAAAATAAA 
951 AATAAAAAAG CAACATATCC TAAATAAAGG ATCCTCCATA ATGTTTCCAC 
1001 CAGATTTCTA ATCAGAAACA TGGAGGCCAG GAAGCAGTGG AGAATGACGA 
1051 CCCTCAGGCA GCCCTGGAGG ATGCTGTCAC AGGCTGGGGC AAGGGCCTTC 
1101 AGGCTACCAA CTGGGAGCTC TGGGAACAGC CCTGTTGCAA ACAGGAAGTC 
1151 ATGGCCCGGC CAGAGCCCAG AATGTGGGCT GAGCTGGGAT CCATGTGACA 
1201 GCTTTGAGGC TCACCGGGAG CAGCCTCTGG ACAGGAGAGG TCCCATCCAG 
1251 GAAACCTCGG GCATGGCTGG GAAGTGGGGT ACTTGGTGCC GGGTCTGTAT 
1301 GTGTGTGTGA CTGGTGTGTG TGAGAGAGAA TGTGTGCCCT GAGTGTCAGT 
1351 GTGAGTCTGT GTATGTGTGA ATATTGTCTT TGTGTGGGTG ATTTTCTGCA 
14 01 TGTGTAATCG TGTCCCTGCA AGTGTGAACA AGTGGACAAG TGTCTGGGAG 
14 51 TGGACAAGAG ATCTGTGCAC CATCAGGTGT GTGCATAGCG TCTGTGCATG 
1501 TCAAGAGTGC AAGGTGAAGT GAAGGGACCA GGCCCATGAT GCCACTCATC 
1551 ATCAGGAGCT CTAAGGCCCC AGGTAAGTGC CAGTGACAGA TAAGGGTGCT 
1601 GAAGGTCACT CTGGAGTGGG CAGGTGGGGG TAGGGAAAGG GCAAGGTCAT 
1651 GTTCTGGAGG AGGGGTTGTG ACTACATTAG GGTGTATGAG CCTAGCTGGG 
1701 AGGTGGATGG CCGGGTCCAC TGAGACCCTG GTTATCCCAG AAGCCTGTGT 
1751 GGGCTTGGGG AGCTTGGAGT GGGGAGAGGG GGTGACTTCT CCGACCAGGC 
1801 CTTTCTACCA CCCTACCCTG GGTAAGGGCC TGGAGCAGGA AGCAGCGGCA 
1851 AGGACCTCTG GAGCAGCCCA TACCTGCCCT GGCCTGACTC TGCCACTGGC 
1901 AGCACAGTCA ACACAGCAGG TTCACTCACA GCAGAGGGCG AAGGCCATCA 
1951 TCAGCTCCCT TTATAAGGGA AGGGTCACGC GCTCGGTGTG CCGAGAGTGT 
2001 CCTGCCTGGT CCTCTGTGCC TGGTGGGGTG GGGGTGCCAG GTGTGTCCAG 
2051 AGGAGCCCAG TTGGTAGTGA GGCAGCCATG GGGCTAGAAG CACTGGTGCC 
2101 CCTGGCCATG ATAGTGGCCA TCTTCCTGCT CCTGGTGGAC CTGATGCACC 
2151 GGCACCAACG CTGGGCTGCA CGCTACCCGC CAGGTCCCCT GCCACTGCCC 
2201 GGGCTGGGCA ACCTTGCTGC ATGTGGACTT CCAGAACACA CCATACTGCT 
2251 TCGACCAGGT GAGGGAGGAG GTCCTGGAGG GCGGCAGAGG TCCTGAGGAT 
2301 GCCCCACCAC CAGCAAACAT GGGTGGTGGG TTAAACCACA GGCTGGATCA 
2351 GAAGCCAGGC TGAGAAGGGG AAGCAGGTTT GGGGGACGTT CCTGGGGAAG 
24 01 GACATTTATA CATGGCATGA AGGACTGGAT TTTCCAAAGG CCAAGGAAGA 
24 51 GTAGGGCAAG GGCCTGGAGG TGGAGCTGGA CTTGGCAGTG GGCATGCAAG 
2501 CCCATTGGGC AACATATGTT ATGGAGTACA AAGTCCCTTC TGCTGACACC 
2551 AGAAGGAAAG GCCTTGGGAA TGGAAGATGA GTTAGTCCTG AGTGCCGTTT 
2601 AAATCACGAA ATCGAGGATG AAGGGGGTGC AGTGACCCGG TTCAAACCTT 
2651 TTGCACTGTG GGTCCTCGGG CCTCACTGCT CACCGGCATG GACCATCATC 
27 01 TGGGAATGGG ATGCTAACTG GGGCCTCTCG GCAATTTTGG TGACTCTTGC 
27 51 AAGGTCATAC CTGGGTGACG CATCCAAACT GAGTTCCTCC ATCACAGAAG 
2801 GTGTGACCCC CACCCCTGCC CCACGATCAG GAGGCTGGGT CTCCTCCTTC 
2851 CACCTGCTCA CTCCTGGTAG CCCCGGGGGT CGTCCAAGGT TCAAATAGGA 
2901 CTAGGACCTG TAGTCTGGGG TGATCCTGGC TTGACAAGAG GCCCTGACCC 
2951 TCCCTCTGCA GTTGCGGCGC CGCTTCGGGG ACGTGTTCAG CCTGCAGCTG 
3001 GCCTGGACGC CGGTGGTCGT GCTCAATGGG CTGGCGGCCG TGCGCGAGGC 
3051 GATGGTGACC CGCGGCGAGG ACACGGCCGA CCGCCCGCCT GCGCCCATCT 
3101 ACCAGGTCCT GGGCTTCGGG CCGCGTTCCC AAGGCAAGCG GCGGTGGGGG 
3151 ACAGAGACCG CGTTTCCGTG GGCCCCGGGT GGACAGTGAC CGTAGCCCAA 
3201 GCAGCGCCGA CAGGGCGTGG GGTCCTGGAC GTGAAACAGA GATAAAGGCC 
3251 AGCGAGTGGG CTGAGGACAG TGGGCCAGGA AACCACCTGC ACGGGGGAGG 
3301 TGCGAGTCTG TGGGCTGGGA GGGGGCGGGG CTACTGCCCA GACCCGCCAG 
3351 AAGCCCGGTG GGCGAGGCTG ATGCGTCGAA GTGGCGGTGG CGGGGACCGC 
34 01 GCCTATGCTG CGGGCTCAGT GTGGGCGGGA CGGGCGGGAT CTTCCTTGAG 
34 51 TGGAAAGGTG GTCAGGGTGG GCAGAGACGA GGTGGGGCCA AACCCCGCCC 
3501 CAGGCAGGGG AGCAATGTGG GTGAGCAAAG AGTGGGCCCT GTGCCCAGCT 
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3551 GGACCGGGCT AGGGACTGCG GGAGACCTTG TGGAGCGCCA GGGTTGGAGT 

3601 GGGTGGCGGA GGGTGGGGCC AAGGCCTTCA TGGCAACGCC CACGTGTCCG 

3651 TCCCGCCCCC AGGGGTGATC CTGTCGCGCT ATGGGCCCGC GTGGCGCGAG 

3701 CAGAGGCGCT TCTCCGTGTC CACCTTGCGC AACTTGGGCC TGGGCAAGAA 

37 51 GTCGCTGGAG CAGTGGGTGA CCGAGGAGGC CGCCTGCCTT TGTGCCGCTT 

3801 CGCCGACCAA GCCGGTGGGT GATGGGCAGA AGGGCACAAA GCGGGAACTG 

3851 GGAAGGCGGG GGACGGAGAA GGCAACCCCT TACCCGCATC TCCCCACCCC 

3901 CAGGACGCCC CTTTCGCCCC AACGGCCTCT TGGACAAAGC CGTGAGCAAC 

3951 GTGATCGCCT CCCTCACCTG CGGGCGCCGC TTCGAGTACG ACGACCCTCG 

4 001 CTTCCTCAGG CTGCTGGACC TAGCTCAGGA GGGACTGAAG GAGGAGTCGG 

4 051 GCTTTCTGCG CGAGGTGCGG AGCGAGAGAC CGAGGAGTCT CTGCAGGGCG 

4101 AGCTCCTGAG AGGTGCCGGG GCTGGACTGG GGCCTCCGAA GGGCAGGATT 

4151 TGCATAGATG GGTTTGGGAA AGGACATTCC AGGAGACCCC ACTGTAAGAA 

4 201 GGGCCTGGAG GAGGAGGGGA CATCTCAGAC ATGGTCGTGG GAGAGGTGTG 

4 251 CCCGGGTCAG GGGGCACCAG GAGAGGCCAA GGACTCTGTA CCCCCGTCCA 

4 301 CGTTGGAGAT TTCGATTTTA GGTTTCTCCT CTGGGCAAGG AGAGAGGGTG 

4 351 GAGGCTGGCA CTTGGGGAGG GACTTGGTGA GGTCAGTGGT AAGGACAGGC 

4 4 01 AGGCCCTGGG TCTACCTGGA GATGGCTGGG GCCTGAGACT TGTCCAGGTG 

4 4 51 AACGCAGAGC ACAGGAGGGA TTGAGACCCC GTTCTGTCTG GTGTAGGTGC 

4 501 TGAATGCTGT CCCCGTCCTC CTGCACATCC CAGCGCTGGC TGGCAAGGTC 

4 551 CTACGCTTCC AAAAGGCTTT CCTGACCCAG CTGGATGAGC TGCTAACTGA 

4 601 GCACAGGATG ACCTGGGACC CAGCCCAGCC ACCCCGAGAC CTGACTGAGG 

4 651 CCTTCCTGGC AAAGAAGGAG AAGGTGAGAG TGGCTGCCAC GGTGGGGGGC 

4 701 AAGGGTGGTG GGTTGAACGT CCCAGGAGGA ATGAGGGGAG GCTGGGCAAA 

4 751 AGGTTGGACC AGTGCATCAC CCGGCGAGCC GCATCTGGGC TGACAGGTGC 

4 801 AGAATTGGAG GTCATTTGGG GGCTACCCCG TTCTATCCCC TGAGTATCCT 

4 851 CTCGGCCCTG CTCAGGCCAA GGGGAGCCCT GAGAGCAGCT TCAATGATGA 

4 901 GAACCTGCGC ATAGTGGTGG GTAACCTGTT CCTTGCCGGG ATGGTGACCA 

4 951 CCTCGACCAC GCTGGCCTGG GGCCTCCTGC TCATGATCCT ACACCTGGAT 

5001 GTGCAGCGTG AGCCCAGCTG GGGCCCAAGG CAGGGACTGA GGGAGGAAGG 

5051 GTACAGCTGG GGGCCCCTGG GCTTAGCTGG GACACCCGGG GCTTCCAGCA 

5101 CAGGCGTGGC CAGGCTCCTG TAAGCCTAAC TTCCTCCAAC ACAGGAGGAA 

5151 GGAGAGTGTC CCCTGGGTGC TGACCCATTG TGGGGACGCA TGTCTGTCCA 

5201 GTCCGTGTCC AACAGGAGAT CGACGACGTG ATAGGGCAGG TGCGGCGACC 

5251 AGAGATGGGT GACCAGGCTC ACATGCCCTA CACCACTGCC GTGATTCACG 

5301 AGGTGCAGCG CTTTGGGGAC ATCATCCCCC TGAGTGTGAC CCATATGACA 

5351 TCCCGTGACA TCGAAGTACA GGGCTTCCGC ATCCCTAAGG TAGGCCTGGC 

54 01 GCCCTCCTCA CCCCAGCTCA GCACCAGCAC CTGGTGATAG CCCCAGCATG 

54 51 GCTACTGCCA GGTGGGCCCA CTCTAGGAAC CCTGGCCACC TAGTCCTCAA 

5501 TGCCACCACA CTGACTGTCC CCACTTGGGT GGGGGGTCCA GAGTATAGGC 

5551 AGGGCTGGCC TGTCCATCCA GAGCCCCCGT CTAGTGGGGA GACAAACCAG 

5601 GACCTGCCAG AATGTTGGAG GACCCAGCGC CTGCAGGGAG AGGGGGCAGT 

5651 GTGGGTGCCT CTGAGAGGTG TGACTGCGCC CTGCTGTGGG GTCGGAGAGG 

5701 GTACTGTGGA GCTTCTCGGG CGCAGGACTA GTTGACAGAG TCCAGCTGTG 

5751 TGCCAGGCAG TGTGTGTCCC CCGTGTGTTT GGTGGCAGGG GTCCCAGCAT 

5801 CCTAGAGTCC AGTCCCCACT CTCACCCTGC ATCTCCTGCC CAGGGAACGA 

5851 CACTCATCAC CAACCTGTCA TCGGTGCTGA AGGATGAGGC CGTCTGGGAG 

5901 AAGCCCTTCC GCTTCCACCC CGAACACTTC CTGGATGCCC AGGGCCACTT 

5951 TGTGAAGCCG GAGGCCTTCC TGCCTTTCTC AGCAGGTGCC TGTGGGGAGC 

6001 CCGGCTCCCT GTCCCCTTCC GTGGAGTCTT GCAGGGGTAT CACCCAGGAG 

6051 CCAGGCTCAC TGACGCCCCT CCCCTCCCCA CAGGCCGCCG TGCATGCCTC 

6101 GGGGAGCCCC TGGCCCGCAT GGAGCTCTTC CTCTTCTTCA CCTCCCTGCT 

6151 GCAGCACTTC AGCTTCTCCG TGGCCGCCGG ACAGCCCCGG CCCAGCCACT 

6201 CTCGTGTCGT CAGCTTTCTG GTGACCCCAT CCCCCTACGA GCTTTGTGCT 

6251 GTGCCCCGCT AGAATGGGGT ACCTAGTCCC CAGCCTGCTC CCTAGCCAGA 

6301 GGCTCTAATG TACAATAAAG CAATGTGGTA GTTCCAACTT GGGTCCCCTG 

6351 CTCACGCCCT CGTTGGGATC ATCCTCCTCA GGGCAACCCC ACCCCTGCCT 

64 01 CATTCCTGCT TACCCCACCG CCTGGCCGCA TTTGAGACGG GTACGTTGAG 

64 51 GCTGAGCAGA TGTCAGTTAC CCTTGCCCAT AATCCCATGT CCCCCACTGA 

6501 CCCAACTCTG ACTGCCCAGA TTGGTGACAA GGACTACATT GTCCTGGCAT 

6551 GTGGGGAAGG GGCCAGAATG GGCTGACTAG AGGTGTCAGT CAGCCCTGGA 

6601 TGTGGTGGAG AGGGCAGGAC TCAGCCTGGA GGCCCATATT TCAGGCCTAA 

6651 CTCAGCCCAC CCCACATCAG GGACAGCAGT CCTGCCAGCA CCATCACAAC 

6701 AGTCACCTCC CTTCATATAT GACACCCCAA AATGGAAGAC AAATCATGTC 

6751 AGGGAGCTAT ATGCCAGGGC TACCTCCCAG GGCTCAGTCG GCAGGTGCCA 

6801 GAACATTCCC TGGGAAGGCC CCAGGAAAAC CCAGGACCGA GCCACCGCCC 

6851 TCAGCCTGTC ACCTTGTGTC CAAAATTGGT GGGTTCTTGG TCTCACTGAC 

6901 TTCAAGAATG AAGCCGTGGA CCCTCACGGT GAGTGTTACA GTTCTTAAAG 

6951 ATGGTGTGTT CAGAGTTTGT TCCTTCTGAT GTTAAGACGT GTTCAGAGTT 

7 001 TCTTCCTTCT GGTGGGTGCG TGGTCTTGCT GGCTTCAGGA GTGAAGCTGC 

7051 AGACCTTCAC AGTGAGTGTT ACGGCTCTTA AGGCTGCACG TACGGAGTTG 




Replacement Sheet 
09/820,788 



7101 TTCATTCTTC CTGGTGGGTT TGTGGTCTCA CTGGCCTCAG GAGTGAAACT 

7151 GCAGTCCTTC CAGTGTTACA ACTCATAAAG GCAGTGTGGA CCCAATGAGG 

7201 GAGCAGCAGC AGCAAGACTT ACTGCAAACA GCAAAAGAAT GATGGCAACC 

7251 AGGTTGCCGC TGCTACTTCA GGCAGCCTGC TTTTATTCCC TTATCTGACC 

7301 CCCACCCACA TCCTGCTGAT TGGCCCATTT TACAGACAGT GGATTGGTCC 

7351 ACTTACAGAG AGCTGATTGG TGCATTTACA ATCCCTGAGC TAGACACAGA 

74 01 GTACTGATTG GTATATTTAC AAACCTTGAG CTAGACACAG AGTGCTGAAT 

74 51 GGTGTATTTA CAATCCCTTA GCTAGACATA AAGGTTGTCC CAGTCCCCAC 

7501 TAGATTAGCT AGATAGAGTA GACAGAGAGC ACTGATTGGT GCGTTTACAA 

7551 ACCTTGAGTT AGACACAGGG TGCTGACTGG TGTGTTTACA AACCTTGAGC 

7 601 TAGACACAGA GTGCTGATTG GTGTATTTAC AATCTTTTAG CTAGAAATAA 

7 651 AGGTTCCCCA AGTCCCCACC AGATTAGCTA GATAGAGTGC TAATTGGTGC 

7701 ATGCACGAAC CCGGAGCTAG ACACAGAGTG CTGATTGGTG CATATACAAT 

7751 CCTCTGGCTA GACATAAAAG TTCTCCAAGT CCCCACCTGA CTCAGGAGCC 

7801 CAGCCAGCTT CGCCTAGTGG ATCCTATGCC AGGGCCACAG GCAGAGCTGC 

7851 CTGCTAGTCC CACACCGGGC ACCTGTACTC CTCAGCCCTT GGGCAGTGGA 

7901 CGGGACCAGG TGCCGTGGAG CAGTGGGAGG CACCCATCCG GGAGGCTCGG 

7 951 GCCTCGCAGG GAGCCCACCG TAGGGAGGCT TGGGCATGGC AGGCTGCAAG 

8001 TCCTGAGCCC TGCCCCGCGG GGAGGTGACT GAGGCCTGGC GACAATTCAA 

8051 GTGTGGTGAG CGCCGGCAGG CCAGCAGTAC TGGGGGACCC GGTGCCCCCT 

8101 CTGCAGCTGC TGGCCCAGGT GCTAAGCCCC TCACTGCCTG GGGCCAGAGG 

8151 CACCAGCCGG CCGCTCCGAG TGCAGGGCCC GCTGAGCCCC TGCCCACCCA 

8201 GAACTGGTGC TGGCCCGCGA GCAACCCAGG TTCCCGCACA CGCCTCTCCC 

82 51 TCCATACCTC CCCGCAAGCA GACGGAGCCG GCTCCAGCCT CCACCAGTCC 

8301 AGAGAGGGGC TCCCACAGTG CAGCGCTGGG CTGAACAAGG TCCTACGCTT 

8351 CCAAAAGGCT TTCCTGACCC AGCTGGATGA GCTGCTAACT GAGCACAGGA 

84 01 TGACCTGGGA CCCAGCCCAG CCCCCCCGAG ACCTGACTGA GGCCTTTCCT 

84 51 GGCAGAGATG GAGAAGGTGA GAGTGGCTGC CACGGTGGGG GGCAAGGGTG 

8501 GTGGGTTGAG CGTCCCAGGA GGAATGAGGG GAGGCTGGGC AAAAGGTTGG 

8551 ACCAGTGCAT CACCCGGCGA GCCGCATCTG GGCTGACAGG TGCAGAATTG 

8601 GAGGTCATTT GGGGGCTACC CCGTTCTGTC CCGAGTATGC TCTCGGCCCT 

8651 GCTCAGGCCA AGGGGAACCC TGAGAGCAGC TTCAATGATG AGAACCTGCG 

8701 CATAGTGGTG GCTGACCTGT TCTCTGCCGG GATGGTGACC ACCTCGACCA 

8751 CGCTGGCCTG GGGCCTCCTG CTCATGATCC TACATCCGGA TGTGCAGCGT 

8801 GAGCCCATCT GGGAAACAGT GCAGGGGCCG AGGGAGGAAG GGTACAGGCG 

88 51 GGGGCCCATG AACTTTGCTG GGACACCCGG GGCTCCAAGC ACAGGCTTGA 

8901 CCAGGATCCT GTAAGCCTGA CCTCCTCCAA CATAGGAGGC AAGAAGGAGT 

8951 GTCAGGGCCG GACCCCCTGG GTGCTGACCC ATTGTGGGGA CGCATGTCTG 

9001 TCCAGGCCGT GTCCAACAGG AGATCGACGA CGTGATAGGG CAGGTGCGGC 

9051 GACCAGAGAT GGGTGACCAG GCTCACATGC CCTACACCAC TGCCGTGATT 

9101 CATGAGGTGC AGCGCTTTGG GGACATCGTC CCCCTGGGTG TGACCCATAT 

9151 GACATCCCGT GACATTCGAA GTACAGGGCT TCCGCATCCC TAAGGTAGGC 

9201 CTGGCGCCNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9301 NNNNNNNCCT GCCCAGGGAA CGACACTCAT CACCAACCTG TCATCGGTGC 

9351 TGAAGGATGA GGCCGTCTGG GAGAAGCCCT TCCGCTTCCA CCCCGAACAC 

94 01 TTCCTGGATG CCCAGGGCCA CTTTGTGAAG CCGGAGGCCT TCCTGCCTTT 

94 51 CTCAGCAGGT GCCTGTGGGG AGCCCGGCTC CCTGTCCCCT TCCGTGGAGT 

9501 CTTGCAGGGG TATCACCCAG GAGCCAGGCT CACTGACGCC CCTCCCCTCC 

9551 CCACAGGCCG CCGTGCATGC CTCGGGGAGC CCCTGGCCCG CATGGAGCTC 

9601 TTCCTCTTCT TCACCTCCCT GCTGCAGCAC TTCAGCTTCT CGGTGCCCAC 

9651 TGGACAGCCC CGGCCCAGCC ACCATGGTGT CTTTGCTTTC CTGGTGAGCC 

97 01 CATCCCCCTA TGAGCTTTGT GCTGTGCCCC GCTAGAATGG GGTACCTAGT 

9751 CCCCAGCCTG CTCCCTAGCC AGAGGCTCTA ATGTACAATA AAGCAATGTG 

9801 GTAGTTCCAA CTCGGGTCCC CTGCTCACGC CCTCGTTGGG ATCATCCTCC 

9851 TCAGGGCAAC CCCACCCCTG CCTCATTCCT GCTTACCCCA CCGCCTGGCC 

9901 GCATTTGAGA CAGGGGTACG TTGAGGCTGA GCAGATGTCA GTTACCCTTG 

9951 CCCATAATCC CATGTCCCCC ACTGACCCAA CTCTGACTGC CCAGATTGGT 

10001 GACAAGGACT ACATTGTCCT GGCATGTGGG GAAGGGGCCA GAATGGGCTG 

10051 ACTAGAGGTG TCAGTCAGCC CTGGATGTGG TGGAGAGGGC AGGACTCAGC 

10101 CTGGAGGCCC ATATTTCAGG CCTAACTCAG CCCACCCCAC ATCAGGGACA 

10151 GCAGTCCTGC CAGCACCATC ACAACAGTCA CCTCCCTTCA TATATGACAC 

10201 CCCAAAACGG AAGACAAATC ATGGCGTCAG GGAGCTATAT GCCAGGGCTA 

10251 CCTACCTCCC AGGGCTCAGT CGGCAGGT 
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Intron : 
Exon: 
Stop 
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Position 

3101 
3439 
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DNA 

Position 
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Protein 

Major Minor Domain Position Major Minor 

C T A Exon 107 T T T 

A G Intron 

C T Exon 24 5 P L 

G A Intron 

T C Intron 

C T Intron 

G A Intron 

C T Intron 



GTGTGACCCCCACCCCTGCCCCACGATCAGGAGGCTGGGTCTCCTCCTTCCACCTGCTCA 
CTCCTGGTAGCCCCGGGGGTCGTCCAAGGTTCAAATAGGACTAGGACCTGTAGTCTGGGG 
TGATCCTGGCTTGACAAGAGGCCCTGACCCTCCCTCTGCAGTTGCGGCGCCGCTTCGGGG 
ACGTGTTCAGCCTGCAGCTGGCCTGGACGCCGGTGGTCGTGCTCAATGGGCTGGCGGCCG 
TGCGCGAGGCGATGGTGACCCGCGGCGAGGACACGGCCGACCGCCCGCCTGCGCCCATCT 
[C,T,A] 

CCAGGTCCTGGGCTTCGGGCCGCGTTCCCAAGGCAAGCGGCGGTGGGGGACAGAGACCGC 
GTTTCCGTGGGCCCCGGGTGGACAGTGACCGTAGCCCAAGCAGCGCCGACAGGGCGTGGG 
GTCCTGGACGTGAAACAGAGATAAAGGCCAGCGAGTGGGCTGAGGACAGTGGGCCAGGAA 
ACCACCTGCACGGGGGAGGTGCGAGTCTGTGGGCTGGGAGGGGGCGGGGCTACTGCCCAG 
ACCCGCCAGAAGCCCGGTGGGCGAGGCTGATGCGTCGAAGTGGCGGTGGCGGGGACCGCG 
<nt 2801-3401 of SEQ ID NO: 3) 

CGGCGGTGGGGGACAGAGACCGCGTTTCCGTGGGCCCCGGGTGGACAGTGACCGTAGCCC 
AAGCAGCGCCGACAGGGCGTGGGGTCCTGGACGTGAAACAGAGATAAAGGCCAGCGAGTG 
GGCTGAGGACAGTGGGCCAGGAAACCACCTGCACGGGGGAGGTGCGAGTCTGTGGGCTGG 
GAGGGGGCGGGGCTACTGCCCAGACCCGCCAGAAGCCCGGTGGGCGAGGCTGATGCGTCG 
AAGTGGCGGTGGCGGGGACCGCGCCTATGCTGCGGGCTCAGTGTGGGCGGGACGGGCGGG 
[A,G] 

TCTTCCTTGAGTGGAAAGGTGGTCAGGGTGGGCAGAGACGAGGTGGGGCCAAACCCCGCC 
CCAGGCAGGGGAGCAATGTGGGTGAGCAAAGAGTGGGCCCTGTGCCCAGCTGGACCGGGC 
TAGGGACTGCGGGAGACCTTGTGGAGCGCCAGGGTTGGAGTGGGTGGCGGAGGGTGGGGC 
CAAGGCCTTCATGGCAACGCCCACGTGTCCGTCCCGCCCCCAGGGGTGATCCTGTCGCGC 
TATGGGCCCGCGTGGCGCGAGCAGAGGCGCTTCTCCGTGTCCACCTTGCGCAACTTGGGC 
(nt 3139-3739 of SEQ ID NO: 3) 

ATGACCTGGGACCCAGCCCAGCCACCCCGAGACCTGACTGAGGCCTTCCTGGCAAAGAAG 
GAGAAGGTGAGAGTGGCTGCCACGGTGGGGGGCAAGGGTGGTGGGTTGAACGTCCCAGGA 
GGAATGAGGGGAGGCTGGGCAAAAGGTTGGACCAGTGCATCACCCGGCGAGCCGCATCTG 
GGCTGACAGGTGCAGAATTGGAGGTCATTTGGGGGCTACCCCGTTCTATCCCCTGAGTAT 
CCTCTCGGCCCTGCTCAGGCCAAGGGGAGCCCTGAGAGCAGCTTCAATGATGAGAACCTG 
[C,T] 

GCATAGTGGTGGGTAACCTGTTCCTTGCCGGGATGGTGACCACCTCGACCACGCTGGCCT 
GGGGCCTCCTGCTCATGATCCTACACCTGGATGTGCAGCGTGAGCCCAGCTGGGGCCCAA 
GGCAGGGACTGAGGGAGGAAGGGTACAGCTGGGGGCCCCTGGGCTTAGCTGGGACACCCG 
GGGCTTCCAGCACAGGCGTGGCCAGGCTCCTGTAAGCCTAACTTCCTCCAACACAGGAGG 
AAGGAGAGTGTCCCCTGGGTGCTGACCCATTGTGGGGACGCATGTCTGTCCAGTCCGTGT 
(nt 4608-5208 of SEQ ID NO: 3) 
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5627 CCCCTGAGTGTGACCCATATGACATCCCGTGACATCGAAGTACAGGGCTTCCGCATCCCT 
AAGGTAGGCCTGGCGCCCTCCTCACCCCAGCTCAGCACCAGCACCTGGTGATAGCCCCAG 
CATGGCTACTGCCAGGTGGGCCCACTCTAGGAACCCTGGCCACCTAGTCCTCAATGCCAC 
CACACTGACTGTCCCCACTTGGGTGGGGGGTCCAGAGTATAGGCAGGGCTGGCCTGTCCA 
TCCAGAGCCCCCGTCTAGTGGGGAGACAAACCAGGACCTGCCAGAATGTTGGAGGACCCA 
[G,A] 

CGCCTGCAGGGAGAGGGGGCAGTGTGGGTGCCTCTGAGAGGTGTGACTGCGCCCTGCTGT 
GGGGTCGGAGAGGGTACTGTGGAGCTTCTCGGGCGCAGGACTAGTTGACAGAGTCCAGCT 
GTGTGCCAGGCAGTGTGTGTCCCCCGTGTGTTTGGTGGCAGGGGTCCCAGCATCCTAGAG 
TCCAGTCCCCACTCTCACCCTGCATCTCCTGCCCAGGGAACGACACTCATCACCAACCTG 
TCATCGGTGCTGAAGGATGAGGCCGTCTGGGAGAAGCCCTTCCGCTTCCACCCCGAACAC 
(nt 5237-5927 of SEQ ID NO: 3) 

67 33 TGAGACGGGTACGTTGAGGCTGAGCAGATGTCAGTTACCCTTGCCCATAATCCCATGTCC 
CCCACTGACCCAACTCTGACTGCCCAGATTGGTGACAAGGACTACATTGTCCTGGCATGT 
GGGGAAGGGGCCAGAATGGGCTGACTAGAGGTGTCAGTCAGCCCTGGATGTGGTGGAGAG 
GGCAGGACTCAGCCTGGAGGCCCATATTTCAGGCCTAACTCAGCCCACCCCACATCAGGG 
ACAGCAGTCCTGCCAGCACCATCACAACAGTCACCTCCCTTCATATATGACACCCCAAAA 
[T,C] 

GGAAGACAAATCATGTCAGGGAGCTATATGCCAGGGCTACCTCCCAGGGCTCAGTCGGCA 
GGTGCCAGAACATTCCCTGGGAAGGCCCCAGGAAAACCCAGGACCGAGCCACCGCCCTCA 
GCCTGTCACCTTGTGTCCAAAATTGGTGGGTTCTTGGTCTCACTGACTTCAAGAATGAAG 
CCGTGGACCCTCACGGTGAGTGTTACAGTTCTTAAAGATGGTGTGTTCAGAGTTTGTTCC 
TTCTGATGTTAAGACGTGTTCAGAGTTTCTTCCTTCTGGTGGGTGCGTGGTCTTGCTGGC 
(nt 6433-7033 of SEQ ID NO: 3) 

7788 TCCC AGTCCCC ACT AGATT AGCT AGATAGAGTAGAC AG AGAGCACTGATTGGTGCGTTT A 

CAAACCTTGAGTTAGACACAGGGTGCTGACTGGTGTGTTTACAAACCTTGAGCTAGACAC 
AGAGTGCTGATTGGTGTATTTACAATCTTTTAGCTAGAAATAAAGGTTCCCCAAGTCCCC 
ACCAGATTAGCTAGATAGAGTGCTAATTGGTGCATGCACGAACCCGGAGCTAGACACAGA 
GTGCTGATTGGTGCATATACAATCCTCTGGCTAGACATAAAAGTTCTCCAAGTCCCCACC 
[-,C,T] 

GACTCAGGAGCCCAGCCAGCTTCGCCTAGTGGATCCTATGCCAGGGCCACAGGCAGAGCT 
GCCTGCTAGTCCCACACCGGGCACCTGTACTCCTCAGCCCTTGGGCAGTGGACGGGACCA 
GGTGCCGTGGAGCAGTGGGAGGCACCCATCCGGGAGGCTCGGGCCTCGCAGGGAGCCCAC 
CGTAGGGAGGCTTGGGCATGGCAGGCTGCAAGTCCTGAGCCCTGCCCCGCGGGGAGGTGA 
CTGAGGCCTGGCGACAATTCAAGTGTGGTGAGCGCCGGCAGGCCAGCAGTACTGGGGGAC 
(nt 7488-8088 of SEQ ID NO: 3) 

7 867 AGGGTGCTGACTGGTGTGTTTACAAACCTTGAGCTAGACACAGAGTGCTGATTGGTGTAT 
TTACAATCTTTTAGCTAGAAATAAAGGTTCCCCAAGTCCCCACCAGATTAGCTAGATAGA 
GTGCTAATTGGTGCATGCACGAACCCGGAGCTAGACACAGAGTGCTGATTGGTGCATATA 
CAATCCTCTGGCTAGACATAAAAGTTCTCCAAGTCCCCACCTGACTCAGGAGCCCAGCCA 
GCTTCGCCTAGTGGATCCTATGCCAGGGCCACAGGCAGAGCTGCCTGCTAGTCCCACACC 
[G, A] 

GGCACCTGTACTCCTCAGCCCTTGGGCAGTGGACGGGACCAGGTGCCGTGGAGCAGTGGG 
AGGCACCCATCCGGGAGGCTCGGGCCTCGCAGGGAGCCCACCGTAGGGAGGCTTGGGCAT 
GGCAGGCTGCAAGTCCTGAGCCCTGCCCCGCGGGGAGGTGACTGAGGCCTGGCGACAATT 
CAAGTGTGGTGAGCGCCGGCAGGCCAGCAGTACTGGGGGACCCGGTGCCCCCTCTGCAGC 
TGCTGGCCCAGGTGCTAAGCCCCTCACTGCCTGGGGCCAGAGGCACCAGCCGGCCGCTCC 
(nt 7567-8167 of SEQ ID NO: 3) 

794 8 T AAAGGTTCCCCAAGTCCCCACCAGATTAGCTAGATAGAGTGCTAATTGGTGCATGCACG 

AACCCGGAGCTAGACACAGAGTGCTGATTGGTGCATATACAATCCTCTGGCTAGACATAA 
AAGTTCTCCAAGTCCCCACCTGACTCAGGAGCCCAGCCAGCTTCGCCTAGTGGATCCTAT 
GCCAGGGCCACAGGCAGAGCTGCCTGCTAGTCCCACACCGGGCACCTGTACTCCTCAGCC 
CTTGGGCAGTGGACGGGACCAGGTGCCGTGGAGCAGTGGGAGGCACCCATCCGGGAGGCT 
[C,T] 

GGGCCTCGCAGGGAGCCCACCGTAGGGAGGCTTGGGCATGGCAGGCTGCAAGTCCTGAGC 
CCTGCCCCGCGGGGAGGTGACTGAGGCCTGGCGACAATTCAAGTGTGGTGAGCGCCGGCA 
GGCCAGCAGTACTGGGGGACCCGGTGCCCCCTCTGCAGCTGCTGGCCCAGGTGCTAAGCC 
CCTCACTGCCTGGGGCCAGAGGCACCAGCCGGCCGCTCCGAGTGCAGGGCCCGCTGAGCC 
CCTGCCCACCCAGAACTGGTGCTGGCCCGCGAGCAACCCAGGTTCCCGCACACGCCTCTC 
(nt 7648-8248 of SEQ ID NO: 3) 

Chromosome mapping: 

Chromosome #22 
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